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Abstract 

Multilevel  (ML)  algorithms  for  eigenvalue  problems  are  often  faced  with  several  types 
of  difficulties  such  as:  the  mixing  of  approximated  eigenvectors  by  the  solution  process,  the 
approximation  of  incomplete  clusters  of  eigenvectors,  the  poor  representation  of  solution  on 
coarse  levels  and  the  existence  of  close  or  equal  eigenvalues.  Algorithms  that  do  not  treat 
appropriately  these  difficulties  usually  fail,  or  their  performance  degrades  when  facing  them. 
These  issues  motivated  the  development  of  a  robust  adaptive  ML  algorithm  which  treats 
these  difficulties,  for  the  calculation  of  a  few  eigenvectors  and  their  corresponding  eigenval¬ 
ues,  presented  in  this  paper.  The  main  techniques  used  in  the  new  algorithm  include:  the 
adaptive  completion  and  separation  of  the  relevant  clusters  on  different  levels,  the  simul¬ 
taneous  treatment  of  solutions  within  each  cluster,  and  the  robustness  tests  which  monitor 
the  algorithm’s  efficiency  and  convergence.  The  eigenvectors’  separation  efficiency  is  based 
on  a  new  ML  projection  technique  generalizing  the  Rayleigh  Ritz  projection,  combined  with 
a  novel  technique,  the  backrotations.  These  separation  techniques,  when  combined  with  an 
FMG  formulation,  in  many  cases  lead  to  algorithms  of  0{qN)  complexity,  for  q  eigenvectors 
of  size  N  on  the  finest  level.  Previously  developed  ML  algorithms  are  less  focused  on  the 
mentioned  difficulties.  Moreover,  algorithms  which  employ  fine  level  separation  techniques 
are  of  0{q^N)  complexity  and  usually  do  not  overcome  all  these  difficulties.  Computationsd 
examples  are  presented  where  Schrodinger  type  eigenvalue  problems  in  2-D  and  3-D,  hav¬ 
ing  equal  and  closely  clustered  eigenvalues,  are  solved  with  the  efficiency  of  the  Poisson 
inultigrid  solver.  A  second  order  approximation  is  obtained  in  0{qN)  work,  where  the  total 
computational  work  is  equivalent  to  only  a  few  fine  level  relaxations  per  eigenvector. 
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Aeronautics  and  Space  Administration  under  NASA  Contract  No.  NASI- 19480  and  NASI- 1860.5 
while  the  author  was  in  residence  at  ICASE,  NASA  Langley  Research  Center,  Hampton,  Va  23681 
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1  Introduction 


Large  scale  eigenvalue  problems  (EP)  arising  from  physics,  chemistry  and  engineering 
often  have  special  features  which  are  not  always  exploited  by  eigenvalue  solvers,  such 
as:  the  EP  can  be  approximated  on  several  levels;  only  few  eigenvalues  and  eigenvec¬ 
tors  are  sought.  These  features  can  be  exploited  naturally  by  multilevel  (ML)  solvers 
as  has  been  done  successfully  by  several  authors  (see  for  example  [3]  and  [12]).  Such 
solvers  generally  involve  the  discretization  of  the  problem  on  a  sequence  of  levels, 
relaxations  employed  on  all  levels,  transfers  of  solutions  and  residuals  from  fine  to 
coarse  levels,  interpolation  of  correction  from  coarse  to  fine  levels,  and  often  a  fine 
level  eigenvector  separation  technique. 

Multilevel  methods  for  EP  sometimes  encounter  difficulties  which  make  their  ro¬ 


bustness  and  efficiency  questionable.  Often  such  difficulties  are  due  to  several  reasons 
which  we  classify  as:  (i)  mixing  of  eigenvectors  by  the  used  procedures,  (ii)  incom¬ 
pleteness  of  a  treated  cluster,  (iii)  incompatible  fine-coarse  level  representation  of  a 
cluster.  Procedures  such  as  relaxations,  transfers  and  ML  cycles  can  introduce  or  am¬ 
plify  an  eigenvector  in  the  error  component  of  an  approximated  eigenvector.  This  we 
refer  as  eigenvector  mixing.  For  example,  it  is  known  that  ML  procedures  mix  Fourier 
components  which  are  the  eigenvectors  of  many  discretized  differential  operators.  If 
the  eigenvectors  of  an  iteration  operator  do  not  coincide  with  the  eigenvectors  of  the 
problem  to  be  solved,  then  the  iteration  may  mix  the  problem’s  eigenvectors.  This  dif¬ 
ficulty  occurs  especially  when  clusters  of  eigenvectors  with  close  or  equal  eigenvalues 
are  approximated.  Usually  it  is  treated  by  simultaneous  separation  techniques,  e.g., 
by  a  Rayleigh  Ritz  type  projection.  However,  if  not  all  eigenvectors  which  are  mixed 
by  the  involved  procedures  are  approximated,  e.g.  the  clusters  are  not  complete,  then 
one  can  expect  the  separations  to  be  inaccurate  and  inefficient.  Thus  a  cluster  will 
be  called  complete  relative  to  a  procedure  if  it  contains  a  whole  set  of  eigenvectors 
which  are  mixed  by  the  procedure.  Difficulties  related  to  incomplete  clusters  may  be 
treated  by  completing  the  clusters  and  processing  them  simultaneously.  The  incom- 
patibilty  of  clusters  representation  on  different  levels  is  another  difficulty  which  has  to 
be  identified  and  taken  care  of.  In  particular,  not  all  clusters  can  be  approximated  on 
an  arbitrarily  coarse  level  and  the  eigenvalues  clustering  may  differ  on  various  levels. 
These  difficulties  and  their  remedies  suggest  the  following  conclusions:  (i)  Clusters  of 


eigenvectors  should  be  treated  simultaneously  using  a  separation  technique  like  the _ 

Rayleigh-Ritz  projection,  (ii)  Clusters  have  to  be  completed,  (III)  Different  clusters  !'’or 
should  be  differently  treated  on  different  levels.  This  suggests  that  previously  de- ;  ' 

veloped  algorithms  may  fail  in  some  standard  situations.  These  include  nonadaptive 
algorithms,  algorithms  which  treat  the  eigenvectors  sequencially  in  clusters,  which  do 
not  complete  the  relevant  clusters,  or  which  do  not  take  into  account  the  inter-level  ~ 
eigenvector  mixing.  Even  in  cases  when  such  algorithms  work,  their  efficiency  may  be 
improved  by  an  zidaptive  treatment  or  by  coarse  level  separation  techniques  as  used  ~~ 

in  this  work.  ^  m  ^ 
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This  paper  focuses  on  a  more  robust  and  efficient  algorithm  for  the  calculation  of 
a  few  eigenvalues  and  their  corresponding  eigenvectors.  Its  development  was  guided 
by  the  above  mentioned  difficulties  and  their  remedies.  Beside  its  robustness,  the 
algorithm  achieves  a  better  computational  complexity  than  previously  known  ML 
eigenvalue  algorithms  which  use  fine  level  projections.  The  robustness  of  the  present 
approach  is  based  on  the  adaptive  completion  and  separation  of  the  relevant  clusters 
on  different  levels;  the  simultaneous  treatment  of  solutions  for  each  cluster;  and  ro¬ 
bustness  tests  which  monitor  the  algorithm’s  convergence  and  efficiency.  A  central 
efficiency  feature  of  the  algorithm  presented  here  results  from  the  newly  developed 
Generalized  Rayleigh  Ritz  (GRR)  projection  and  backrotation  (BR),  which  employ 
the  projection  on  coarse  levels,  adaptively  with  the  cluster  involved.  This  reduces  in 
many  cases  the  most  time  consuming  part  of  the  algorithms,  namely,  the  fine 

level  separation  work,  to  0{qN)  for  q  eigenvectors  of  size  N  on  finest  level. 

These  ideas  are  combined  in  an  FMG  algorithm  which  first  solves  the  problem 
on  coarse  levels,  then  interpolates  the  solutions  to  finer  levels  where  they  serve  as 
an  initial  approximation  to  the  corresponding  finer  level  problems.  On  the  currently 
finest  level,  clusters  are  identified  and  tested  for  completeness,  completed  if  necessary, 
and  improved  by  ML  cycles  using  coarser  levels.  The  eigenvalue  equations  is  relaxed 
on  each  level  followed  by  FAS  transfers.  Generalized  Rayleigh  Ritz  projection  and 
backrotations  (GRR-BR)  are  employed  on  coarse  levels  usually,  in  order  to  separate 
eigenvectors  within  their  clusters,  and  to  keep  the  coarse  level  representation  of  the 
solutions  as  close  as  possible  to  the  fine  level  solutions.  This  is  done  adaptively  for 
different  clusters  on  appropriate  levels.  On  the  level  on  which  the  algorithm  starts, 
only  a  part  of  the  sought  eigenvectors  are  approximated  usually,  and  more  eigenvectors 
with  their  corresponding  eigenvalues  are  added  on  finer  levels. 

In  the  examples  presented  here  for  the  Schrodinger  eigenvalue  problem  in  2-D 
and  3-D,  two  to  four  fine  level  relaxations  per  eigenvector  were  performed.  Equal 
eigenvalues  were  calculated  with  more  than  ten  decimal  places  and  accurate  results 
were  obtained  for  very  close  eigenvalues  as  well.  A  second  order  approximation  is 
obtained  by  1-FMG-V(1,1)  in  0{qN)  work,  for  q  eigenvectors  of  size  N  on  the  finest 
level. 

The  present  approach  can  be  extended  to  nonlinear  eigenvalue  problems,  an  ex¬ 
ample  being  presented  in  Costiner  and  Ta’asan  [6]. 

We  refer  to  the  early  works  of  Hackbusch  [9]  McCormick  [17]  ,  Bank  [1]  for  theory 
on  multilevel  eigenvalue  solvers  and  first  algorithms.  A  sequencial  ML  algorithm  for 
linear  eigenvalue  problems  performing  the  projection  on  fine  levels  is  presented  in 
Brandt  McCormick  and  Ruge  [3].  More  theory  and  algorithms  on  ML  EP  may  be 
found  in  Hackbusch  [12].  Zaslavski  in  [23],  [24],  uses  an  adaptive  algebraic  correction 
scheme  cycle  to  compute  the  first  eigenvector  and  its  eigenvalue  for  the  multigroup 
neutron  diffusion  equation.  The  elements  of  such  a  ML  cycle,  modified  to  the  FAS 
form,  can  be  used  in  the  algorithm  presented  here.  Our  approach  differs  from  previous 
ML  approaches  mainly  by  the  following  issues:  emphasis  on  robustness,  simultaneous 
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cluster  processing,  cluster  completion,  ML  separation  by  GRR  and  BR,  treatment  of 
eigenvector  mixing,  treatment  of  close  and  equal  eigenvalues.  The  ML  projection  idea 
was  first  introduced  by  Ta’asan  [21];  backrotations  were  introduced  in  Costiner  [5]. 
The  combination  of  our  technique  with  domain  decomposition  techniques  is  natural 
but  was  not  analysed  yet.  For  some  domain  decomposition  techniques  for  eigenvalue 
problems  see  for  example  Bourquin  and  d’Hennezel  [2],  and  Luo  [14].  A  review  article 
on  single  level  large-scale  complex  eigenvalue  problems,  containing  many  references,  is 
Kerner  [13].  For  a  theory  on  Ritz  projections  and  on  algebraic  eigenvalue  problems 
we  refer  to  Parlett  [19],  Wilkinson  [22],  Golub  and  Van  Loan  [8].  The  single 
level  technique  to  obtain  the  eigenvectors  by  relaxations  and  projections  is  refered  in 
different  places  as  subspace,  simultaneous  or  Ritz  iterations.  We  refer  to  Nikolai  [18], 
Rutishauser  [20]  and  McCormick  [17]  for  a  single  level  algorithm  and  mathematical 
foundations. 

The  paper  is  organized  as  follows.  Section  2  presents  the  GRR  ML  projection,  the 
backrotations  and  the  multilevel  cycle.  Section  3  presents  and  discusses  the  adaptive 
techniques  such  as  the  robustness  tests,  the  cluster  completion,  and  the  adaptive 
FMG.  Section  4  presents  computational  examples. 

2  Multilevel  Projection  Techniques 

One  of  the  key  elements  of  our  algorithm  is  the  ML  projection  used  to  separate 
eigenvectors  corresponding  to  closely  clustered  and  equal  eigenvalues.  To  motivate 
the  new  projection  method  consider  a  fine  level  problem 

AhOii  —  DhA  =  0  (2.1) 

where  A  =  diag{Xi, . . . ,  X,,)  contains  on  diagonal  the  q  sought  eigenvalues  corre¬ 
sponding  to  the  desired  eigenvectors  which  are  the  columns  of  0^-  Assume  that  Uh 
consists  of  linearly  independent  combinations  of  eigenvectors  belonging  to  the  sought 
subspace.  In  this  case,  a  Rayleigh-Ritz  (RR)  projection  provides  a  A  and  a  q  x  q 
invertible  matrix  E  such  that  A  and  =  UhE  are  solutions  of  (2.1).  The  FAS  coarse 
level  equation  after  performing  the  projection  on  the  fine  level  becomes 

AMhE  -  U2hEA  =  T^^E  (2.2) 

where  —  I\^Ah.Uh,  is  the  usual  FAS  right  hand  side.  The  form  of 

the  coarse  level  solution  U2hE  =  I^'^UhE  suggests  that  one  may  obtain  E  and  A  on 
the  coarse  level  from  2.2.  This  can  be  done  by  a  generalized  version  of  the  (RR) 
projection,  presented  next. 
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2.1  Generalized  Rayleigh  Ritz  Projections  and  Backrota- 
tions 

Solutions  {E,\)  of  (2.2)  may  not  exist  if  Uh  is  not  a  basis  of  span{Uh).  However, 
as  in  the  usual  (RR)  projection,  if  Uh  approximates  a  basis  of  span{Uh),  one  finds 
E  and  A  such  that  the  projection  of  the  residual  of  (2.2)  on  span{U2k)  is  minimized, 
i.e.,  solving  a.  q  x  q  generalized  eigenvalue  problem  as: 

iU2hfiA2kU2k  -  T^\)E  =  {U2KfU2hE\  (2.3) 

The  process  of  solving  for  {E,  A)  given  {A2h,  U2hi  T^h)  is  denoted  by 

(£;,  A)  4-  GRR(A2fc,  t/2/v,  T^h)  (2-4) 

The  above  projection  is.  refered  later  on  as  generalized  Rayleigh  Ritz  Projectinn 
(GRR)  or  as  multilevel  ML  projection  when  several  levels  are  involved.  The  GRR 
projection  cannot  be  directly  combined  with  the  usual  FAS  correction  Uh  —  Uh  + 
l2h{U2ii  —  JV^Uh)t  since  it  will  change  an  exact  fine  level  solution  Uh,  e.g.  if  E  is  not 
the  identity  but  rather  a  permutation  matrix.  This  difficulty  can  be  solved  by  using 
a  modified  FAS  correction  such  as 

(/,  =  £/,£  +  -  /f  £/,£)  (2.5) 

Note  that  (  2.5  )  would  lead  to  0{q^N)  operations,  equivalent  to  a  fine  level  projection 
work.  Thus  it  is  desirable  to  replace  (  2.5  )  with  more  efficient  techniques.  Other 
difficulties  may  occur  for  degenerate  subspaces  when  any  matrix  can  serve  as  a  solution 
for  E,  thus,  mixing  and  destroying  orthogonality  of  fine  level  solutions  for  example. 

A  natural  technique  to  fix  these  diflBculties  is  introduced  and  used  in  combination 
with  the  multilevel  projection.  It  is  refered  to  as  backrotation  because  of  its  geo¬ 
metrical  meaning.  As  suggested  by  the  above  discussion,  for  degenerate  subspaces 
the  backrotation  should  produce  block  diagonal  submatrices  in  E  which  are  close 
to  the  identity  matrix  of  the  appropriate  dimension  and  eliminate  permutations  of 
eigenvectors.  A  backrotation  step  will  be  further  denoted  by 

(FJ,  A)  ♦- BackRotation(£^,  A)  (2.6) 

A  particular  backrotation  algorithm  is: 

BackRotation 
Input  {E,\) 

1)  Sort  the  eigenvalues  of  A  and 

permute  the  columns  of  E  accordingly 

2)  Determine  the  clusters  of  eigenvalues  of  A 

to  be  considered  degenerate,  and 

determine  the  clusters  to  be  considered  nondegenerate 
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3)  For  each  diagonal  block  in  E 

associated  with  a  nondegenerate  cluster  do: 

Bring  to  the  diagonal  the  dominant  elements  of  the  block  by 

permuting  the  columns  of  F, 

and  correspondingly  the  diagonal  of  A. 

4)  Let  F  be  a  block  diagonal  matrix 

whose  diagonal  blocks  are  the  diagonal  blocks  of  F, 
corresponding  to  the  clusters. 

Replace  each  diagonal  block  which  does  not  correspond 
to  a  degenerate  cluster  by  the  corresponding  identity  matrix 

5)  Set  E  = 

6)  Change  the  signs  of  columns  of  F 

to  get  positive  elements  on  diagonal. 

7)  Normalize  the  columns  of  F. 

Output  (F,  A) 


2.2  Multilevel  Combined  Cycles 

In  general,  not  all  required  eigenvectors  can  be  well  approximated  on  an  arbitrarily 
coarse  level.  For  example,  fine  level  discretized  Laplace  operator  eigenvectors  corre¬ 
sponding  to  larger  eigenvalues  may  not  be  representable  on  coarse  levels.  Moreover, 
the  eigenvector  cluster  structure  may  differ  on  various  levels.  However,  the  efficient 
solution  of  the  eigenvectors  corresponding  to  small  eigenvalues  may  often  be  done 
using  coarse  levels.  A  major  difficulty  is  that  solutions  belonging  to  a  cluster  are 
often  mixed  by  procedures,  i.e.  the  procedures  may  regard  linear  combinations  of 
solutions  as  a  solution.  This  obstacle  can  be  overcomed  by  simultaneously  treating 
all  the  cluster’s  eigenvectors  and  separating  them  on  different  levels. 

Efficiency  and  convergence  considerations  require  that  the  GRR  projection  should 
be  done  for  different  clusters  on  different  levels.  Moreover,  the  coarsest  level  used  to 
treat  a  given  cluster  may  not  coincide  with  the  level  on  which  the  GRR  projection 
is  done.  Thus,  the  full  algorithm  depends  on  some  parameters  aissociated  with  each 
cluster,  that  determine  the  flow  of  the  algorithm  for  that  cluster.  These  parame¬ 
ters  are  determined  adaptively  during  the  solution  process  (as  explained  in  the  next 
section). 

Following  is  a  description  of  a  basic  ML  cycle  used  in  the  adaptive  algorithm 
presented  in  the  next  section. 

Let  q'  eigenvectors  be  approximated  by  j  clusters  on  level  k: 

Uk^(Ul . Ui)  (2.1) 

where,  each  f/*  approximates  f/J,  the  solution  of 

AuUi  =  Ui\^  +  ’lt  t  =  l,...,j  (2.2) 
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where  each  Ul  is  a  matrix  whose  columns  are  eigenvectors  and  A*  is  a  diagonal  ma¬ 
trix  whose  diagonal  elements  are  the  corresponding  eigenvalues.  Usually,  on  the  finest 
level,  k  =  m,  Tk  =  {T^, . . . ,  T^)  =  (0, . . .  ,0).  Denote  A  =  diag{A^, . . . ,  A-').  For  each 
cluster  Uj.  let  be  the  level  on  which  the  GRR-BR  projection  is  done,  and  the 
coarsest  level  used  in  the  ML  process  for  this  cluster.  Here  it  is  assumed  that  li  <  K- 
Denote  Ip  =  (/’, . . . ,  /^),  Ic  =  (/],..., l^)  and  by  a  function  transfer  from  level  k  to 
level  j.  For  improving  a  given  approximation  {(/„,,  A,  r„i),  a  multilevel  cycle  consist¬ 
ing  in  a  sequence  of  cycles  for  each  cluster  in  turn,  is: 

(f/,„,  A,r,„,/p,/c,7')  CL-MLP  {m,  A„„U,n,  A,T,nJpJc,q') 

For  i  =  1, . . . ,  j  do: 

For  k  =  m, . . . ,  do: 

Repeat  Times: 

If  /•  =  Ik  then  {Ui,  A\  T^)  ^  GRR-BR(m,  Ak,  Ui,  A‘,  Ti,  ,  k) 

U'k  ••-Relax  (m,  Ak,  Ui,  A\  Tl,  k,  /‘) 

If  fc  >  /*  then: 

Set  k  =  k 

'^k  —  ^k+li'^k+l  ~  ^fc+i^fc+i)  +  ^if^k 

End 

For  k  =  ll, . . .  ,m  do: 

If  (*  >  (•)  Correct  (/{  =  (/£  +  ItM;  -  It'Vi) 

Repeat  NjJ  Times 

U'k  <— Relax  (m,  Ak,  Ui,  A’,  Ti,  k,  /’) 

If  /•  =  fc  then  {Ui,  A\  Ti)  ^  GRR-BR(m,  Ak,  Ui,  A\  Ti,  /’ ,  k) 

End 

End 


The  GRR-BR  algorithm  used  above  is  the  following: 

{Ui,A\Ti)  ^  GRR-BR{m,Ak,Ui,A\Ti,li,k) 

Perform 

{E,A')  ^GRRiAk,Ui,Ti) 

{E,A')  BackRotatation(E,  A') 

Ui  =  uiE 

Ti  =  TIE 


Observe  that  in  CL-MLP  the  clusters  are  treated  sequentially  and  within  each 
cluster  the  solutions  are  treated  simultaneously  in  the  ML  cycle.  A  simultaneous 
cycle  for  several  clusters  is  obtained  by  grouping  the  clusters  into  a  single  larger 
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cluster  and  applying  CL-MLP  to  it.  This  may  be  used  to  improve  separation  between 
clusters.  Observe  that  if  for  each  cluster  the  GRR-BR  projection  is  performed  on 
the  finest  level,  the  algorithm  still  requires  less  work  than  an  algorithm  performing 
the  fine  level  projection  for  all  clusters  simultaneously.  Moreover,  if  mixings  occur 
on  coarse  levels,  one  may  expect  an  algorithm  using  fine  level  separation  to  have 
a  poor  efficiency;  a  coarse  level  separation  usually  improves  the  efficiency  in  such 
cases.  For  well  separated  eigenvalues  the  projection  may  not  be  needed  except  at 
initial  coarse  level  stages  of  the  FMG,  later  the  eigenvalues  determine  the  separation 
of  eigenvectors  via  the  multilevel  cycles.  The  same  holds  for  well  separated  clusters 
which  do  not  need  a  simultaneous  separation.  This  is  especially  useful  for  a  larger 
number  of  eigenvectors,  belonging  to  well  separated  clusters  (e.g.  already  for  10 
eigenvectors  the  improvement  can  be  noticeable).  Accurate  normalization,  if  needed, 
can  be  performed  as  the  last  step  only  on  the  finest  level. 

Several  parameters  in  this  algorithm  have  to  be  specified,  such  as  coarsest  levels 
Ic  and  Ip,  and  numbers  of  iterations.  These  parameters  are  chosen  by  the  adaptive 
algorithm  discussed  in  next  section.  The  choice  of  Ip  and  Ic  depends  on  cluster  and 
stage  of  the  algorithm.  The  number  of  iterations  Nk  can  be  deduced  for  simple  cases 
based  on  Fourier  or  2-level  cycle  analysis  or  in  general  cases,  by  robustness  tests 
performed  during  execution.  The  number  of  relaxations  can  vary  with  level.  In  the 
computational  tests  one  or  two  relaxations  per  fine  level  passing  were  performed. 

sought 


3  Adaptive  Multilevel  Algorithms 

For  robustness,  the  construction  of  an  adaptive  version  of  the  CL-MLP  is  essential 
since  the  grouping  of  eigenvectors  into  clusters,  the  sizes  of  the  clusters,  the  coarsest 
level  corresponding  to  a  given  cluster  are  not  known  in  advance,  usually.  Eigenvectors 
belonging  to  clusters  usually  are  mixed  by  different  procedures.  This  deteriorates  the 
algorithms’  efficiency  and  often  prevents  convergence.  A  typical  difficulty  occurs 
when  a  procedure  approximates  only  several  eigenvectors  of  a  cluster.  Then  the 
non  approximated  eigenvectors  usually  are  the  dominant  components  of  the  errors 
which  are  hard  to  eliminate  due  to  mixing.  This  suggests  to  complete  the  clusters  and 
separate  the  solutions  within  clusters  whenever  necessary.  Simultaneous  techniques 
treating  at  a  time  all  solutions  belonging  to  a  complete  cluster,  can  be  eeisier  coupled 
with  separation  techniques  at  different  stages,  thus  acquiring  better  efficiency  than 
sequencial  techniques  which  hardly  avoid  difficulties  due  to  mixing. 

In  the  adaptive  algorithm,  the  clusters  are  tested  for  completion  and  completed. 
The  cluster  completion  is  tested  on  all  current  finest  levels  and  performed  on  several 
levels  since  the  structure  of  clusters  can  differ  on  different  levels. 

The  full  multilevel  solver  described  below  starts  on  coarse  levels,  and  solves  there 
for  as  many  eigenvectors  as  possible.  Then  it  uses  those  as  an  initial  approximation 
for  finer  level  solutions  where  more  eigenvectors  are  eidded  if  needed. 
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Two  important  parts  of  this  algorithm  are  the  completion  and  addition  of  clusters. 
The  completion  of  a  cluster  is  done  by  adding  in  turn  a  new  vector  and  improving  it 
by  multilevel  cycles.  An  approximate  eigenvalue  is  associated  with  this  eigenvector, 
by  a  Rayleigh  quotient.  If  the  eigenvalue  is  close  to  the  cluster  then  the  new  vector 
is  added  to  the  cluster.  If  the  found  eigenvalue  does  not  belong  to  the  cluster  then 
the  cluster  is  considered  complete.  The  convergence  of  the  additional  eigenvector  is 
not  sought.  At  the  end,  the  complete  cluster  is  improved  by  several  CL-MLP  cycles. 

The  addition  of  a  new  clusters  is  usually  done  in  the  first  stages  of  the  algorithm 
when  not  enough  complete  clusters  are  found.  This  is  performed  by  the  cluster 
completion  algorithm  described  next. 

Denote  by  dj  the  current  dimension  of  the  cluster  f/^.  The  cluster  completion  and 
cluster  addition  algorithms  are  given  by: 


(;,  Ul,  AJ, /p,  Ic, q')  <-Complete-Cluster(j,  /J, q') 

Until  (Cluster-Completion-Test  =  TRUE)  Do 
Choose  random  (ft 

Until  <  Ak<j>,<l>  >  f  <  <f>,<f>  >  and  residuals  stabilize  Do: 
T^,  0,  />,  1 )  ^CL-MLP(fc,  <!>,  A^„„,  T’,  0,  P,,  1 ) 
Separate  <j>  from  {Ul,. . . ,  Vl) 

Set  A  =<  Ak<t>y <i>>  I  <  <i>,4>> 

VI  -  (UU) 

A-'  diag{A\  A) 

9'  =  7'  +  1 ,  dj  =  dj  + I 
End 

Perform  ^CL-MLP(t.(/',  A', I', rf,) 


Ui  Vt,  A,  Tt,  Ip,  Ip,  <i')  <-Add-Cluster(j,  Ut,  A,  Tr,  Ip,  <i’) 

Set  j  =  j  +  I 

{h  f-^k^  A-',  /p,  4,7')  ^Complete-Cluster(j,  Ul,  Tl,  /^,  7') 

Setf4  =  (f/^...,f/i),  A  =  (A4...,A0 


The  Separation  of  <f)  from  the  other  eigenvectors  may  be  performed  by  orthogo¬ 
nality,  projection  or  by  a  ML  cycle. 

Another  crucial  part  of  the  algorithm  are  the  robustness-tests.  Robustness  tests 
are  techniques  which  find  parameters  to  be  used  in  a  certain  procedure  for  given 
data  such  that  the  procedure  will  be  convergent  and  will  be  as  efficient  as  possible. 
Typically,  the  procedure  is  tested  over  a  set  of  data,  or  information  from  intermediate 
results  is  used  in  order  to  reduce  the  work  involved  in  testing.  Next,  for  simplicity, 
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we  consider  only  the  robustness  test  which  provides  the  parameters  (/p,/c)  for  the 
CL-MLP  cycle.  This  ensures  the  proper  values  for  the  parameters  IcJp-  in  order 
to  get  convr.^jnce  and  the  best  attainable  efficiency  for  CL-MLP.  This  is  done  by 
examining  Tor  clusters  the  different  possible  values  for  lc,lp-  The  use  of  an  FMG 
algorithm  suggests  that  these  parameters  for  a  given  cluster  will  stabilize  with  the 
refinement  process.  Thus,  a  notion  of  a  stabilized  cluster  is  introduced  in  order  to 
save  unnecessary  work  in  looking  for  new  values  for  Ic,  Ip  for  stabilized  clusters. 

A  complete  cluster  on  level  L  is  called  stabilized  if  it  corresponds  to  a  complete 
cluster  from  level  L—  1  or  L-f  1  in  the  sense  of  the  number  of  eigenvectors  in  the  cluster, 
the  values  of  the  eigenvalues  and  the  eigenvectors  approximation.  For  stabilized 
cluster,  corresponding  to  a  coarser  level  stabilized  cluster,  we  take  the  Ic,  Ip  values 
from  the  corresponding  coarser  level  cluster.  For  not  stabilized  clusters,  which  would 
exist  usually  on  coarse  levels  only,  we  perform  a  search  to  obtain  best  values  for  Ic,  Ip. 
This  is  done  by  performing  ML  cycles  with  different  choices  of  these  parameters,  and 
choosing  the  ones  that  perform  best.  This  is  summarized  in  the  next  algorithm.  Such 
tests  are  inexpensive  when  performed  on  coarse  levels,  and  often  lead  to  significant 
fine  level  work  savings.  Moreover,  such  tests  are  essential  to  ensure  the  algorithm’s 
convergence. 

Convergence  is  always  obtained  since  at  least  the  single  level  cycle  converges,  being 
a  subspace  iteration  algorithm  [8]. 

Denote  by  (/^,  m)  the  Ip  and  /,  parameters,  for  the  level  m  cycle  for  the  cluster 
j,  and  by  :=  n{CL  -  M  LP{m,  A,n,Vm,  the  convergence 

rate  (measured  by  the  residual  decrease)  of  the  CL-MLP  cycle  for  cluster  j  on  level 
m,  using  the  parameters  {lp,lc).  The  following  algorithm  updates  {lp,lc)  on  level  m: 

{lpJc,m)  *-  Robustness-Test  {m,  A,n,Um,  A,Tm,lp,lc,q') 

For  j  Do: 

If  <  e  ) 

then 

1) 

else 

If(|A'.-AL,l>0  or  if  (A^  )  is  not  approximated 
then 

Solve  for 

min,,  <  m, 

else 

-  1) 

endif 

endif 

Observe  that  since  the  initial  values  for  on  any  level  are  taken  from  the  next 
coarser  level,  the  search  needed  in  the  Robustness- Test  is  over  just  a  few  choices  of 
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the  parameters. 

The  full  ML  algorithm  uses  as  building  blocks  the  CL-MLP,  Add-Cluster,  Complete- 
Cluster  and  Robustness-Test  algorithms  described  before.  It  is  defined  as: 

Adaptive-FMG(7n,  q,  A) 

Set  A:  =  1,  (/'  =  0,  j  =  0,  li  =  k 

Until  (q'  >  q  or  q'  >  adim/t)  Perform 

O',  Uk,  A,  Tjt,  /p,  L,  q')  <-Add-Cluster(j,  Uk,  A,  Tk,  Ip,  h,  q') 

{Uk,  A,  Tk,  Ip,  h,  q')  ^CL-MLP(fc,  Uk,  A,  Tk,  Ip,  h,  q') 

Until  k  >  m  Do: 

If  k  <  rn  then: 

Set  A;  =  A:  -I-  1,  Uk  =  /it_if4-i,  Tk  =  0 

endif 

If  (q‘  >  q)  then: 

If  (Cluster-Completion- Te8t=TRUE  then: 

{Uk,  A,  Tk,  Ip,  h,  q’)  ^CL-MLP(A:,  Uk,  A,  Tk,  Ip,  h,  q') 

Else 

(j,  Ul,\\  Ti,lj„  ll, q')  ^Complete-ClusterO,  A4  7^4  /^,  T^,q') 
{Uk,  A,  Tk,  Ip,  Ic,  q')  ^CL-MLP(A:,  Uk,  A,  Tk,  Ip,  h,  q') 

endif 

Else 

Until  {q'  >q  or  q'  >  adimfc)  Perform 

{j,  Uk,  A,  Tk,  Ip,  Ic,  q')  Add-Cluster(j,  Uk,  A,  Tk,  Ip,  h,  q') 

{Uk,  A,  Tk,  Ip,  Ic,  q')  *-CL-MLP{k,  Uk,  A,  Tk,  Ip,  h,  q') 

endif 


3.1  Storage  and  Complexity 

For  the  Adaptive-FMG  algorithm,  storage  is  required  for  the  q  eigenvectors  of  size  N 
on  the  finest  level,  the  potential  and  the  corresponding  right  hand  sides,  on  all  levels, 
giving  an  overall  estimate  of  memory  of  order  0{3{N  -f-  1))  for  problems  in  2-D  and 
3-D.  The  FMG  work  requires  0{N)  operations  per  eigenvector.  The  work  performed 
on  coarsest  levels  should  be  added  to  these  estimates.  In  the  performed  tests  where 
a  few  eigenvectors  were  sought,  the  coarse  level  work  wcis  usually  a  fraction  of  the 
finest  level  work.  If  accurately  zero  scalar  products  are  needed  on  finest  levels  then 
orthonormalizations  or  projections  may  be  required  within  the  finest  level  degenerate 
or  close  clustered  eigenspaces.  However,  as  can  be  seen  in  the  computational  ex¬ 
amples,  accurate  orthogonality  inside  degenerate  clusters  may  be  obtained  by  coarse 
level  separation  also. 

In  the  computational  examples  presented  here,  a  complexity  of  0{qN)  is  obtained. 
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4  Computational  Examples  and  Discussions 

We  have  chosen  several  examples  in  which  we  focus  on  different  aispects  of  the  method. 
The  first  example  is  of  an  adaptive  algorithm  in  which  we  take  a  case  of  special 
difficulties  in  terms  of  clustering  of  eigenvalues,  and  correspondence  of  these  clusters 
between  levels.  The  second  example  shows  that  it  is  enough  to  treat  the  clusters  in  a 
sequential  manne’’,  and  separation  has  to  be  done  within  each  cluster  only  to  obtain 
good  convergence  and  accurate  separation. 

The  third  example  shows  that  the  new  multilevel  projection  technique  may  be 
performed  on  coarsest  levels,  even  in  cases  with  closely  clustered  or  equal  eigenvalues, 
thus  reducing  the  computational  work  significantly.  A  last  example  shows  that  the 
same  efficiency  is  obtained  for  problems  in  3-D  as  for  problem  in  2-D. 

In  all  examples  the  periodic  boundary  conditions  Schrodinger  eigenvalue  problem 

(A  —  V)u  =  Au  (4.1) 

defined  on  Q  =  [0,0]*^  (d=2  or  3)  where  a  =  2n/\0,  v/as  considered.  The  potentials 
are  chosen  such  that  the  distribution  of  eigenvalue  present  special  difficulties.  A  finite 
difference  discretization  on  cartesian  grids  is  used. 

Example  I:  Adaptive  Algorithm 

As  au  example  for  the  adaptive  algorithm  we  have  chosen  a  potential  which  deter¬ 
mines  a  distribution  of  eigenvalues  which  present  special  difficulties.  Not  only  closely 
clustered  eigenvalues  are  present  but  also  the  correspondence  of  the  clusters  between 
levels  is  not  appropriate,  as  explained  later. 

The  potential  for  this  case  is  V{x,y)  =  5-l-3sm(10x)  and  the  results  are  presented 
in  Tables  1  and  2.  The  first  q  =  \2  eigenvalues  were  required,  and  have  been  approx¬ 
imated  using  an  adaptive  1-FMG-V(1,1)  algorithm  which  uses  as  its  coarsest  level  a 
employs  4x4  coarsest  grid. 

The  ith  eigenvalue  and  eigenvector  will  be  denoted  next  by  A^  and  u,.  The  boxes 
in  Table  2  show  the  clusters  of  close  or  equal  eigenvalues  (with  (-)  sign)  found  by 
the  algorithm  (the  formats  are  chosen  to  outline  the  equal  digits).  As  can  be  seen 
there  the  cluster  structure  on  the  different  levels  is  not  the  same.  Particularly,  level  2 
cluster  structure  differs  from  the  level  I  cluster  structure.  Observe  that  the  cluster  of 
6  eigenvalues  on  level  1  (Ae  —  An),  with  multiplicities  1-4-1,  has  no  correspondence 
on  level  2. 

For  demonstrating  the  adaptive  flow  of  the  algorithm  we  give  a  full  history  of  one 
specific  run,  for  the  potential  discussed  here. 

The  particular  algorithm  described  in  section  3  is  used. 

The  algorithm  started  on  level  1  adding  eigenvectors  until  the  cluster  containing 
Ai2  was  completed.  Note  that  Aje,  the  last  eigenvector  that  was  found  belongs  to  the 
next  cluster,  confirming  the  completeness  of  the  Iwt  sought  cluster.  Observe  that 
on  level  1,  A^  belongs  to  a  cluster  consisting  of  two  degenerate  subspaces,  each  of 
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dimensiou  2,  and  the  eigenvalues  corresponding  to  these  degenerate  subspaces  are 
close  to  within  0(10“^)  relative  difference. 

The  relevant  eigenvectors  ui, . . . ,t;i5  were  interpolated  to  level  2  where  they  pro¬ 
vided  initial  guesses  for  the  level  2  algorithm.  Here  the  completion  of  clusters  restarted 
but  this  time  working  with  the  cluster  structure  from  level  1  and  using  two  level  cycles. 
A  test  was  done  for  the  efficiency  of  a  simultaneous  cycle  with  fine  level  projection. 
The  cycle  was  performed  to  provide  first  approximations  of  the  level  2  eigenvalues. 
The  cluster  structure  and  eigenvalues  obtained  were  compared  with  the  ones  of  level 
1.  Since  the  agreement  was  not  satisfactory,  except  for  vi,  a  cluster  completion  algo¬ 
rithm  started  with  U2-  The  completion  continued  until  the  complete  cluster  containing 
the  leist  sought  eigenvector  was  obtained,  (e.g.  for  level  2,  the  desired  uij  belongs  to 
the  cluster  uio  —  uia-  The  completion  W2is  ensured  by  the  far  value  of  A14).  Then  the 
relevant  eigenvectors  were  updated  by  one  or  a  few  cycles. 

The  solution  obtained  oh  level  2  was  interpolated  to  level  3  where  a  cluster  comple¬ 
tion  test  was  satisfied  only  by  the  first  cluster,  ui.  The  cluster  completion  algorithm 
was  applied  to  the  remaining  eigenvectors  (using  cycle  robustness  tests  and  the  cluster 
completion  tests).  This  resulted  in  few  cj'cles  per  eigenvector.  The  parameters  Ic  and 
Ip  were  found  in  the  following  way:  1)  for  cluster  1,  (uj),  the  values  were  obtained 
from  previous  level  since  this  cluster  was  stabilized  from  the  beginning;  2)  For  cluster 
2  and  3  (ue  ~  ^^9  and  uio  ~  V13  )  /c  and  Ip  were  taken  from  level  2  values  since  these 
clusters  resulted  stabilysed  after  the  cluster  completion;  3)  Robustness  tests  were 
used  for  cluster  4  since  the  eigenvalues  Aio  —  A13  on  level  L=3  and  the  next  coarse 
level  L=:2  were  not  enough  close.  Then  one  cycle  (here  V(l,l)),  wais  performed  for 
each  cluster. 

The  level  4  cluster  completion  test  was  satisfied  by  the  first  3  clusters,  eigenvalues 
vi  —  vg  and  their  parameters  were  taken  from  level  3.  The  cluster  completion  algo¬ 
rithm  was  again  applied  to  cluster  4,  (ujo  —  U13),  a  few  cycles  being  sufficient,  then 
the  parameters  were  taken  the  corresponding  level  3  ones  since  the  cluster  resulted 
stabilysed.  One  cycle  was  performed  for  each  cluster. 

The  level  5  cluster  completion  test  was  satisfied  by  all  relevant  eigenvectors  (uj  — 
U13).  One  cycle  (V(l,l)),  was  performed  for  each  cluster.  The  Ic  and  Ip  for  the  separate 
clusters,  in  the  final  cycles,  on  levels  3,  4,  5  were  found  as:  for  V\:  /^  =  /p  =  1,  for 
the  other  clusters,  containing  U2,...,t;i3  Ic  =  ip  =  2  were  obtained,  (  a  test  for  the 
asymptotic  convergence  rate,  for  cluster  Uio  —  U13  may  lead  to  1^  =  Ip  =  3,  but  such  a 
test  was  not  used  in  this  run). 

The  additional  last  eigenvector  obtained  in  the  cluster  completion  test,  used  just 
to  ensure  that  the  previous  cluster  was  complete,  was  not  needed  and  not  used  in 
further  steps.  Usually  its  convergence  was  poor  since  the  algorithm  didn’t  separate 
it  from  the  next  eigenvectors  in  its  cluster  e.g.  on  level  2,  to  separate  Ah  from  the 
next  7  eigenvectors  with  close  eigenvalues. 

Observe  that  a  second  order  approximation  and  a  good  convergence  rate  of  order 
0(10"*)  for  the  first  cycle  are  obtained.  A  simultaneous  cycle  for  all  clusters  with 
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separation  on  the  coarsest  common  level  for  all  clusters  (here  level  2  )  would  improve 
the  efficiency  of  first  cycle  but  was  not  needed.  (This  also  would  improve  the  scalar 
products  which  resulted  of  order  10"^  after  first  FMG  cycle,  in  this  ceise,  accurate 
orthogonality  being  obtained  by  the  algorithm  described  next). 

Observe  that  this  algorithm  is  of  order  0{qN)  if  one  does  not  use  fine  level  sep¬ 
aration  inside  the  clusters.  The  adaptive  coarse  level  work  on  levels  1,  2,  took  ap¬ 
proximately  1/6  of  the  computer  time  and  on  levels  1,  2,  3,  approximately  1/4  of  the 
computer  time.  This  is  a  fixed  time  and  it  would  contribute  only  with  1/16  if  level  6 
would  be  employed  too. 


able  1 

1  -FMG-V(1,1),  2-D,  16-EV,  5-Uv 
Residuals  at  Start  and  End  of  ML  Cycles 


.48E+2 


.53E+2 
.61E+2 
.66E-1-2 

5  .55E-t-2 

6  .52E-t-2 
.59E+2 

8  .6IE+2 

9  .62E+2 

10  .73E+2 

11  .S8E+2 

12  .S4E+2 

13  .51E+2 

14  .44E+2 

15  .53E+2 

16  .69E+2 


evel  2 

1  level 

3 

1 

evel  4 

1  level 

.37E-13 


.44E-13 

.38E-13 

.68E-13 

.39E-13 


.12E-12 

.31E^12 

.20&12 

.17E-12 

.13&12 

.34E-12 


.39B-12 

.70E-12 

.96E-12 

.16E^12 


.19E-06 


.60E-1  .64E-04 


.69E+0  .97E-13  .22E+0  .30E-12 


.30E4-2  .14B-12  .llE+2  .35E-12  .30E-H  .86E-03 

.30E+2  .80E-13  .llE+2  .29E-12  .30E+1  .86E-03 

.30E+2  .17E-12  .llE+2  .30E-12  .30E+1  .54E.02 

.30E+2  .24E-12  .llE+2  .45E-12  .30E+1  .54E-02 


.llE-1-3  .32E-12  .16E+2  .35E-12  .44E+1  .42E-02 

.4SE+2  .54E-11  .16E+2  .32Ei-12  .44E+1  .42E-02 

.4SE-(-2  .S7E-11  .16E-1-2  .4tE-12  .44E+1  .39E-02 

.45E+2  .71E-11  .16E+2  .33E-12  .44E+1  .16Ei-02 


.4SE-(-2  .15E-09  .12E+3  .30E-09  .43E+2  .72E-08 

.llE+3  .41Ei-09  .12E+3  .16E-09  .43E+2  .20E.08 

.12E+3  .81E-11  .12E+3  .26E-09  .43E+2  .21E^05 

.12E+3  .SOE'04  .12E-)-3  .50E-09  .43E+2  .16E-05 


.12E+3  .19E-05  I  .12E+3  .17B-06 
.12E+3  .5SE+01 


.15E-1 


.76E+0 

.76E+0 

.76E+0 

.76E+0 


•llE+l 

•llE+l 

•llE+l 

•llE+l 


.12E-I-2 

.12E4-2 

.12E-f2 

.12E+2 
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.14E-04 


.73E-04 

.73E-04 

.llE-02 

.llE-02 


.82E-03 

.82E-03 

.83E-03 

.93E-03 


.33E-02 

.33E-02 

.29E-01 

.29E-01 


level  1 


.496347395806E+1 


.860204208719E+2 
.860204208719E+2 
.8605694691 39E+2 
.860569469139E+2 


.1670  E+3 
.167113893828E+3 
.167113893828E+3 
.167113893828E+3 

10  .16711389.3828E+3 

11  .16715  E-t-3 

12  .248170840742E+3 

13  .248170840742E+3 

14  .248207366784E+3 

15  .248207366784E-I-3 

16  .329264313697E+3 


1  -FMG-V{t,l),  2-D,  16-EV,  5-LEV 
Eigenvalues  at  End  of  ML  Cycles 


iimi 


.495721389176E+1  I  .495552134150E-I-1  1  .49550931 7773E-(-l 


.99921 3342469E-I-2  .103677004418E-I-3  .104634633842E-I-3 

.999313342469E-I-2  .I036770044I8E-I-3  .104634633842E-I-3 

.9995  E-)-2  .10371  E-l-3  .1046  E-l-3 

.99998  E-H2  .10375  E-(-3  .1047  E-f3 


.194919376181E-I-3 

.19491937618lE-f-3 

.194962161804E-(-3 

.194962161804E-<-3 


.1046  E-l-3 
.1047  E-f3 


.204351 758395E-I-3 
.204351 758395E-I-3 
.204395  E-l-3 
.204396  E-l-3 


.3291 85001 S47E+3  .38481 2002762E-I-3  .399841022256E-I-3 

.3291 85001 547E-I-3  .38481 3002762E-I-3  .399841022256E-I-3 

.329227787655E-I-3  .3848590736  E-l-3  .399888846  E-l-3 

.329227787656E-1-3  .38485907.39  E-|-3  .399888846  E-l-3 


.42419190801  lE-l-3  |  .483580557031  E-l-3  1  .499567983067E-f3 
.424395844705E-I-3 


evel  5 


.495498173425E-I-1 


.10487469501 2E-I-3 
.1 0487469501 2E-I-3 
.10491  E-f3 
.10495  E-l-3 


.204831 900326E-I-3 
.204831 900326E-I-3 
.20487691 8643E-I-3 
.304876918643E-I-3 


.4036731 03803E-I-3 
.403673103808E-I-3 
.403720980600E-I-3 
.403720980600E-I-3 


Example  II:  Fine  level  Separation 

In  the  next  exanmple  the  potential  V{xyy)  =  5  +  3sin(10i)  +  2cos(10t/)  causes 
a  further  spliting  of  the  eigenvalues.  The  clusters  were  treated  sequencially  and  the 
projection  for  each  cluster  was  performed  on  the  finest  level  to  provide  accurate  finest 
level  separation  inside  clusters.  The  results,  for  9  eigenvectors,  are  presented  in  Tables 
3  and  4.  A  10-FMG-V(1,1)  algorithm  was  used  to  show  the  constant  convergence  rate 
per  cycle.  The  coarsest  relaxation  level  for  clusters  2  and  3  was  level  2  and  for  the 
first  eigenvector  wais  level  1 .  On  levels  1  and  2  the  adaptive  algorithm  and  few  cycles 
were  used.  All  eigenvectors  came  out  accurately  orthogonal  scalar  products 

on  level  4). 

This  shows  that  it  is  enough  to  perform  separation  only  within  clusters. 


Table  3 

10  -FMG-V(I,1).  2-D,  9-EV,  4-Lev 
Residual!  at  Start  and  End  of  ML  Cyc 

es 

Evect. 

level  1 

level  2 

level  3 

level  4  II 

1 

0.48E+02 

0.14E:-13 

0.83E+00 

0.12E-12 

0.27E-i-00 

o.iiE-n 

0.72E-01 

0.41E-11 

0.46E+02 

0.83E-09 

0.30E+02 

0.48E-09 

0.1lE-f02 

0.42E-12 

0.30E4-01 

0.21E-11 

0.52E+02 

0.29E-09 

0.30E-i-02 

0.12E-08 

O.llE-l-02 

0.75E-12 

0.30E4-01 

0.52E-11 

0.56E+02 

0.S6E-I0 

0.30E-1-02 

0.73E-09 

0.nE-)-02 

0.93E-12 

0.30E-I-01 

0.56E-11 

0.54E+02 

0.85E-09 

0.30E-f02 

0.S5E-08 

0.11E-H02 

0.17E-n 

0.30E-t-01 

0.12E-10 

0.53E+02 

0.57Ei-02 

O.nE-i-03 

0.40E-05 

0.16E+02 

0.61  B-1 2 

0.44E-)-01 

0.16E-10 

0.53E+02 

0.68&n 

0.45E-f02 

0.57E-05 

0.16E-f-02 

O.lOE-11 

0.44E-(-01 

0.39E-10 

8 

0.41  E+02 

0.13E-10 

0.45E-f02 

0.29E-05 

0.16E+02 

0.82E-12 

0.44E-I-01 

0.33E-10 

9 

0.43E-(-02 

0.80E-02 

0.nE-H03 

0.t4E-05 

0.16E+02 

0.83E-12 

0.44E+01 

0.48E-10 

Table  4 

10  -FMG-V(1,1),  2-D,  9-EV,  4-LEV 

Eigenvalues  at  End  of  ML  Cycles 

\tm 

level  1 

level  2 

level  3 

level  4  1 

Bi 

-0.860202443918E+02 

-0.9991  E-f02 

-0.10367E-i-03 

-0.10463E-I-03 

Q 

-0.860202443918E-)-02 

-0.999361  E-f02 

-0.1036931  E-(-03 

-0.104650E-t-03 

II 

-0.860406326305E-^02 

-0.999362E-»-02 

-0.1036937E-f03 

-0.104651E-I-03 

a 

-0.860406326305E-f02 

-0.9997E+02 

-0.1037E-f03 

-0.10469E-I-03 

6 

-0.1670E-»-03 

-0.19491  E-t-03 

-0.20243E4-03 

-0.20434E-i-03 

7 

-0.1671 13893828E-i-03 

-0.19493E-f03 

-0.20245-1-03 

-0.20436E-f-03 

8 

-0.1671 13893828E-i-03 

-0.19495E-1-03 

-0.20247E-I-03 

-0.20439E-f03 

9 

-0.16713E-f03 

-0.19497E-i-03 

-0.20249E-I-03 

-0.2044E-f03 

Example  III:  Coarse  Level  Separation,  2D  and  3D 

In  the  next  two  runs  (Tables  5,  6)  we  show  that  a  coarsest  level  separation  {Ic  = 
//  =  1),  even  for  clusters  containing  very  close  and  degenerate  eigenvalues,  can  be 
enough  to  provide  accurate  finest  level  separation.  In  the  degenerate  clusters  the 
eigenvectors  were  not  orthogonalized  on  finest  levels  but  resulted  so  from  the  FMG, 
where  orthogonality  was  imposed  on  the  coarsest  level  solution  (in  the  FMG,  not 
during  the  cycles).  This  implies  an  0{qN)  algorithm  even  for  close  clustered  cases. 

Table  5  shows  results  for  a  problem  in  2-D  with  a  potential  V{x,y)  =  2  + 
0.1sm(10z  +  lOy),  which  produces  a  splitting  of  the  first  cluster  of  four  eigenval- 
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ues  into  two  degenerate  clusters  whose  eigenvalues  are  close  to  within  10“^.  A  second 
order  approximation  was  obtained  by  an  1-FMG-V(1,1)  algorithm  and  the  asymp¬ 
totic  convergence  rate  per  fine  level  cycle  was  1/10.  Observe  the  13  equal  digits  of  the 
degenerate  eigenvalues,  on  all  levels.  On  level  5,  8  cycles  were  performed  to  show  the 
constant  convergence  rate  per  cycle,  (see  cycles  3  and  8  where  the  convergence  rate 
is  accurately  1/10).  The  eigenvectors  came  out  accurately  orthogonal,  even  in  the 
degenerate  eigenspaces,  although  the  projection  was  performed  only  on  the  coarsest 
level,  (the  eigenvector’s  scalar  products  being  of  order  10“'^  on  level  5). 

The  same  efficiency  is  obtained  for  problems  in  3-D  as  can  be  seen  in  Table  6. 
The  potential  V{x,y,z)  =  2  -f  stn(20x  -f  lOy  —  IO2)  determines  a  cluster  of  six  close 
eigenvalues  grouped  into  two  clusters  of  two  and  four  equal  eigenvalues  (with  13 
digits).  Observe  the  1/10  convergence  rate  in  cycle  3  and  the  first  6  common  digits 
of  the  eigenvalues  in  the  cluster. 

These  examples  indicate  that  the  reduction  of  complexity  of  the  newly  developed 
algorithm  from  0{q^N)  to  0{qN)  is  quite  general  and  also  works  for  rather  difficult 
cases. 


Table  S 

1  -FMG-V(1,1),  2-D,  5-EV,  S-LEV 


vector  nnt  res  last  res 


0.18&02 

0.30E4-01 

0.30E+01 

0.30E+01 

0.30E+01 


0.46E-03 

0.76E+00 

0.76E+00 

0.76E+00 

0.76E+00 


0.35B-0S 

0.27E-04 

0.27E-04 

0.27E-04 

0.27E-04 


0.97E-11 

0.29E-09 

0.29E-09 

0.29E-09 

0.29E-09 


0.13E-03 

0.43E-02 

0.43E-02 

0.43E-02 

0.43E-02 


5 


0.36E-04 

0.40E-03 

0.40E-03 

0.40E-03 

0.40E-03 


0.26E-10 

0.31E-10 

0.31E-10 

0.31E-10 

0.31E-10 


0.1 999975244971 5E+01 
0.10162979203934E+03 
0.10162979203934E-f-03 
0.101 72831 140738E+03 
0.10172931140738E+03 


-0.19999750026388E-t-01 
-0.10186970728937E-I-03 
-0.1 01 86970728937E+03 
-0.10196970729590E-f03 
-0.1019697D729590E-h03 


-0.19999749801 202E-I-01 
-0.10186970049930E-(-03 
-0.1018697D049930E-I-03 
-0. 1 01 96970049780E-H» 
-0.1 01 96970049780E-4-03 


-0.1 99997497991 43E-I-01 
-0.10186970048459E-I-03 
-0.10186970048459E-I-03 
-0.10196970048303E-I-03 
-0.10196970048302E-I-03 


Table  6 

1  -FMG-V(1,1),  3-D,  T-EV,  3-LEV. 


igmMiM'.'-i.iwi 

3 

1 

Kin  il  ■ 

3 

0.2SE-02 

3 

0.35E-01 

0.26E-02 

4 

0.23E-01 

-0.100720682691 98E+03 

S 

0.23E-01 

0.31  E-02 

-0.10072068269198B-I-03 

6 

0.23E-01 

0.21B-02 

-0.10072068369198E+03 

7 

0.23B-01 

0.31  B-02 

-0.10072068269198B+03 

5  Conclusions 

A  robust  and  efficient  ML  -algorithm  to  compute  a  few  eigenvectors  and  the  corre¬ 
sponding  eigenvalues  for  large  scale  eigenvalue  problems  has  been  developed.  The 
algorithm’s  robustness  results  from  the  adaptive  completion  and  treatment  of  clus¬ 
ters,  the  simultaneous  treatment  of  solutions  in  each  cluster,  and  from  tests  which 
monitor  the  algorithm’s  convergence  and  efficiency.  The  algorithm  treats  central  dif-  . 
Acuities  such  as:  the  poor  solution  representation,  on  coarse  levels,  the  existence  of 
clustered  eigenvalues,  the  approximation  of  incomplete  clusters,  and  the  mixing  of 
approximated  eigenvectors  during  the  solution  process.  Its  eigenvector  separation  ef¬ 
ficiency  stems  from  a  new  ML  projection  technique  which  is  a  generalization  of  the 
Rayleigh  Ritz  projection,  combined  with  backrotations. 

In  the  cases  when  the  algorithm  properly  separates  the  eigenvectors  on  coarse 
levels,  the  algorithm’s  complexity  is  of  0{qN)  for  q  eigenvectors  of  size  N  on  the 
finest  level.  The  numerical  tests  showed  that  an  accurate  fine  level  separation  was 
obtained  by  the  coarse  level  projection,  even  for  problems  with  very  close  or  equal 
eigenvalues. 

The  results  of  the  numerical  tests  for  Schrodinger  eigenvalue  problems,  show  that 
the  algorithm  achieved  the  same  accuraw:y,  using  the  same  amount  of  work  (per  eigen¬ 
vector),  as  the  Poisson  multigrid  solver.  A  second  order  approximation  is  obtained 
using  the  5-point  in  2-D  and  7-point  in  3-D  discretized  Laplaceian,  by  1-FMG-V(1,1) 
in  0{qN)  work.  This  means  that  the  work  was  of  order  a  few  (2-4)  fine  level  re¬ 
laxations  per  eigenvector.  The  adaptive  work  weis  only  part  of  the  fine  level  work 
and  enhanced  the  fine  level  cycle  efficiency.  Constant  convergence  rate  per  cycle  v/as 
obtained  for  the  presented  cases.  The  robustness  of  the  algorithm  has  been  demon¬ 
strated  on  problems  with  eigenvalue  distribution  that  present  special  difficulties. 
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each  cluster,  and  the  robustness  tests  which  monitor  the  algorithm’s  efficiency  and  convergence.  The  eigenvectors’  separation 
efficiency  is  based  on  a  new  ML  projection  technique  generalizing  the  Rayleigh  Ritz  projection,  combined  with  a  novel 
technique,  the  backrotations.  These  separation  techniques,  when  combined  with  an  FMG  formulation,  in  many  cases  lead 
to  algorithms  of  0{gN)  complexity,  for  q  eigenvectors  of  size  N  on  the  finest  level.  Previously  developed  ML  algorithms  are 
less  focused  on  the  mentioned  difficulties.  Moreover,  algorithms  which  employ  fine  level  separation  techniques  are  of  0(q^ff) 
complexity  and  usually  do  not  overcome  all  these  difficulties.  Computational  examples  are  presented  where  Schrodinger  type 
eigenvalue  problems  in  2-D  and  3-D,  having  equal  and  closely  clustered  eigenvalues,  are  solved  with  the  efficiency  of  the  Poisson 
moltigrid  solver.  A  second  order  approximation  is  obtained  in  0{qN)  work,  where  the  total  computational  work  is  equivalent 
to  only  a  few  fine  level  relaxations  pet  eigenvector. 


15.  NUMBER  OF  PAGES 
20 


16.  PRICE  CODE 

A03 


Standard  Form  29B  (Rev  2-89) 

bmcnbwl  by  ANSI  tM  rJS-ll 

rss-is] 


14.  SUBJEa  TERMS 

leading  eigenvalues. 

multilevel  algorithms 

17.  SECURITY  CLASSIFICATION 

18.  SECURITY  CLASSIFICATION 

19.  SECURITY  CLASSIFICATION 

OF  REPORT 

OF  THIS  PAGE 

OF  ABSTRACT 

Unclassified 

Unclassified 

NSN  7540-01 -280-S500 


Iku.t.  GOVERNMENT  PRINnNC  OFTICE:  Nfl  -  72SOt4/a*eM 


